Optimal Binning for Genomics

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimal Data-Based Binning for Histograms

Histograms are convenient non-parametric density estimators, which continue to be used ubiquitously. Summary quantities estimated from histogram-based probability density models depend on the choice of the number of bins. In this paper we introduce a straightforward data-based method of determining the optimal number of bins in a uniform bin-width histogram. Using the Bayesian framework, we der...

متن کامل

TANDEM: integrating automated allele binning into genetics and genomics workflows

SUMMARY Computer programs for the statistical analysis of microsatellite data use allele length variation to infer, e.g. population genetic parameters, to detect quantitative trait loci or selective sweeps. However, observed allele lengths are usually inaccurate and may deviate from the expected periodicity of repeats. The common practice of rounding to the nearest whole number frequently resul...

متن کامل

Constant-complexity Stochastic Simulation Algorithm with Optimal Binning

At the molecular level, biochemical processes are governed by random interactions between reactant molecules, and the dynamics of such systems are inherently stochastic. When the copy numbers of reactants are large, a deterministic description is adequate, but when they are small, such systems are often modeled as continuous-time Markov jump processes that can be described by the chemical maste...

متن کامل

How Optimal Is Algebraic Binning Approach: A Case Study of the Turbo-Binning Scheme With Uniform and Nonuniform Sources

This paper investigates the optimality of the binning approach in distributed source coding for both uniform and nonuniform sources. While the algebraic binning scheme is optimal for uniform sources both asymptotically and at finite lengths, it is shown that the optimality holds only asymptotically for nonuniform sources. Highperformance turbo codes are used with the binning scheme on several s...

متن کامل

Single-Cell-Genomics-Facilitated Read Binning of Candidate Phylum EM19 Genomes from Geothermal Spring Metagenomes.

The vast majority of microbial life remains uncatalogued due to the inability to cultivate these organisms in the laboratory. This "microbial dark matter" represents a substantial portion of the tree of life and of the populations that contribute to chemical cycling in many ecosystems. In this work, we leveraged an existing single-cell genomic data set representing the candidate bacterial phylu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Computers

سال: 2019

ISSN: 0018-9340,1557-9956,2326-3814

DOI: 10.1109/tc.2018.2854880